Precise Computer Performance Comparisons Via Statistical Resampling Methods

نویسندگان

  • Bin Li
  • Shaoming Chen
چکیده

Performance variability, stemming from nondeterministic hardware and software behaviors or deterministic behaviors such as measurement bias, is a well-known phenomenon of computer systems which increases the difficulty of comparing computer performance metrics. Conventional methods use various measures (such as geometric mean) to quantify the performance of different benchmarks to compare computers without considering variability. This may lead to wrong conclusions. In this paper, we propose three resampling methods for performance evaluation and comparison: a randomization test for a general performance comparison between two computers, bootstrapping confidence estimation, and an empirical distribution and five-number-summary for performance evaluation. The results show that 1) the randomization test substantially improves our chance to identify the difference between performance comparisons when the difference is not large; 2) bootstrapping confidence estimation provides an accurate confidence interval for the performance comparison measure (e.g. ratio of geometric means); and 3) when the difference is very small, a single test is often not enough to reveal the nature of the computer performance due to the variability of computer systems. We further propose using empirical distribution to evaluate computer performance and a five-number-summary to summarize computer performance. We illustrate the results and conclusion through detailed Monte Carlo simulation studies and real examples. Results show that our methods are precise and robust even when two computers have very similar performance metrics. Keywords— Performance of Systems; Performance attributes; Measurement, evaluation, modeling, simulation of multipleprocessor systems; Experimental design

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computer-intensive methods in the analysis of species-habitat relationships

Statistical habitat models give quantitative descriptions of species-habitat relationships by analysing records of environmental conditions and species incidence by means of logistic regression or other techniques. The most useful model in a given situation is usually unknown a priori. Several data-driven, computer-intensive methods exist to automatically fit a large number of candidate models ...

متن کامل

The Jackknife Estimation Method Avery I

Statistical resampling methods have become feasible for parametric estimation, hypothesis testing, and model validation now that the computer is a ubiquitous tool for statisticians. This essay focuses on the resampling technique for parametric estimation known as the Jackknife procedure. To outline the usefulness of the method and its place in the general class of statistical resampling techniq...

متن کامل

Bootstrapping Conic Multivariate Adaptive Regression Splines (Bcmars)

Bootstrapping is a computer-intensive statistical method which treats the data set as a population and draws samples from it with replacement. This resampling method has wide application areas especially in mathematically intractable problems. In this study, it is used to obtain the empirical distributions of the parameters to determine whether they are statistically significant or not in a spe...

متن کامل

The Jackknife Estimation Method

Statistical resampling methods have become feasible for parametric estimation, hypothesis testing, and model validation now that the computer is a ubiquitous tool for statisticians. This essay focuses on the resampling technique for parametric estimation known as the Jackknife procedure. To outline the usefulness of the method and its place in the general class of statistical resampling techniq...

متن کامل

Comparison of Computer Vision and Photogrammetric Approaches for Epipolar Resampling of Image Sequence

Epipolar resampling is the procedure of eliminating vertical disparity between stereo images. Due to its importance, many methods have been developed in the computer vision and photogrammetry field. However, we argue that epipolar resampling of image sequences, instead of a single pair, has not been studied thoroughly. In this paper, we compare epipolar resampling methods developed in both fiel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015